trees/subtree: Add MTC subtree primitives by beautifulentropy · Pull Request #8808 · letsencrypt/boulder

beautifulentropy · 2026-06-17T17:01:55Z

Implement the generation and verification of MTC subtree consistency proofs, which prove that an aligned interval of the log (a subtree) belongs to the tree under a given root hash. The MTC draft defines these in section 4.

x/mod's sumdb/tlog provides the hash primitives and the stored-hash reader these functions use. Its proofs cover only two cases: an inclusion proof for a single entry and a consistency proof for a prefix of the log. MTC requires a proof for any subtree; x/mod has no such proof.

Export three functions:

HashLeaves computes an RFC 6962 Merkle Tree Hash over a slice of leaf hashes.
ConsistencyProof generates an MTC Subtree Consistency Proof.
VerifyConsistency verifies an MTC Subtree Consistency Proof against a given root hash.

A note to reviewers: this package and its tests were written with assistance from Claude Opus 4.8 and Fable 5. That being said, it has been reviewed and extensively revised by myself before submission.

jsha

After a first look, my high level comments:

There are a number of places where subtree.go panics. I notice that the upstream tlog.go panics in a bunch of places where the match is wrong too, but I'd like to see if we can have fewer panic sites to reason about. For instance, what if ValidSubtree() returns a Subtree() object, which has accessors for Hi(), Lo(), Size(), and Height(). We can guarantee in the contract for Size() and Height(), for instance, that they are positive (non-negative and non-zero). This might also let us consolidate the various //nolint: gosec // G115 correctness comments into a smaller number of places.

We could make this into a constructor:

type Subtree struct {
  start, len, treeSize uint64
}

func New(start, end, treeSize int64) (Subtree, error) {}

I notice that both internal callers of ValidSubtree() check end > treeSize immediately after. That suggests to me that ValidSubtree() should be a function of not just start and end, but also of tree size, so that we can check that property as part of it.

jsha · 2026-06-25T20:56:26Z

+	return int64(1) << (bits.Len64(uint64(n-1)) - 1) //nolint:gosec // G115: n > 1, so n-1 is positive.
+}
+
+// SubtreeHash returns the RFC 6962 section 2.1 Merkle Tree Hash over leaves


Since a subtree is a Merkle tree, this calculates a Merkle Tree Hash. I think this function is equivalent to tlog.TreeHash and we can reuse that, right?

I note that this function takes a list of leaves in-memory, while tlog.TreeHash takes a HashReader, which generalizes over hash storage. But if we want to use leaves in-memory, we can pass a HashReaderFunc that accesses them.

So, I think we can do without this function.

HashLeaves, renamed from Hash to make its intention clearer, has just one external caller that needs it and it requires this shape. For each uploaded package, the tlog-mirror-srv's add-entries handler hashes the received leaf entries with tlog.RecordHash, combines those leaf hashes into the subtree hash with HashLeaves, and verifies the package's subtree consistency proof against the pending checkpoint's root. At that point the entries are not in any tree and thus not loaded into a reader.

That said, it doesn't necessarily have to live here, this just seemed like the best home for it.

jsha · 2026-06-25T21:19:32Z

+// rangeHash returns MTH(D[lo:hi)), the RFC 6962 section 2.1 Merkle Tree Hash
+// over the leaves in [lo, hi) as an independent list, read through the provided
+// reader. It decomposes [lo, hi) into its maximal aligned perfect subtrees and
+// reads all of their roots in a single ReadHashes call before folding them
+// together.
+func rangeHash(lo, hi int64, reader xtlog.HashReader) (xtlog.Hash, error) {


I think this can also be implemented in terms of tlog.TreeHash(hi-lo, someReader).

The distinction is that TreeHash will make storage calls starting from zero, but we need to make sure our storage layer is correctly indexing based on the subtree we want. I think we can achieve that with a tlog.HashReaderFunc:

func subtreeHash(lo, hi int64, reader xtlog.HashReader) (xtlog.Hash, error) { return tlog.TreeHash(hi-lo, func(zeroBasedIndexes []int64) []Hash { var absoluteIndexes []int64 for _, i := range zeroBasedIndexes { absoluteIndexes = append(absoluteIndexes, i+lo) } return reader.ReadHashes(absoluteIndexes) } }

Using the HashReaderFunc is very clever, but the way you're calculating the offset is subtly wrong. The indexes TreeHash hands our reader aren't leaf numbers. They're stored hash indexes, a flat numbering of every node in the tree, with leaves and internal nodes interleaved. So i + lo is adding a leaf count to something that isn't a leaf position. The offset has to account for the node's level. A leaf may be lo away from another leaf, but a node spanning 8 leaves moves by only lo / 8.

So instead of offsetting the index, we decode it, shift its position by lo scaled to that level, and re-encode:

level, n := tlog.SplitStoredHashIndex(idx) absolute := tlog.StoredHashIndex(level, (lo>>level)+n)

Which gives us:

func rangeHash(lo, hi int64, reader tlog.HashReader) (tlog.Hash, error) { return tlog.TreeHash(hi-lo, tlog.HashReaderFunc( func(zeroBasedIndexes []int64) ([]tlog.Hash, error) { absoluteIndexes := make([]int64, len(zeroBasedIndexes)) for i, idx := range zeroBasedIndexes { level, n := tlog.SplitStoredHashIndex(idx) absoluteIndexes[i] = tlog.StoredHashIndex(level, (lo>>level)+n) } return reader.ReadHashes(absoluteIndexes) })) }

But fixing the offset surfaces our next problem. lo>>level is only exact when lo is a multiple of 2^level, which holds for every range we currently pass to rangeHash(). If a later change passes a lo that isn't, the shift drops bits, the index points to a different node, and we silently fold a wrong hash.

So we would need to add a guard:

func rangeHash(lo, hi int64, reader tlog.HashReader) (tlog.Hash, error) { return tlog.TreeHash(hi-lo, tlog.HashReaderFunc( func(zeroBasedIndexes []int64) ([]tlog.Hash, error) { absoluteIndexes := make([]int64, len(zeroBasedIndexes)) for i, idx := range zeroBasedIndexes { level, n := tlog.SplitStoredHashIndex(idx) loAtLevel := lo >> level if loAtLevel<<level != lo { return nil, fmt.Errorf("subtreeHash: lo=%d is not a multiple of 2^%d", lo, level) } absoluteIndexes[i] = tlog.StoredHashIndex(level, loAtLevel+n) } return reader.ReadHashes(absoluteIndexes) })) }

For what it's worth, your idea does work. That said, the version we have doesn't require this awkward translation layer that's somewhat difficult to reason about.

aarongable · 2026-06-26T20:07:03Z

+// that the leaves are an aligned subtree.
+//
+// https://datatracker.ietf.org/doc/html/rfc9162#section-2.1.1
+func Hash(leaves []tlog.Hash) tlog.Hash {


This API shape is surprising to me. Being willing to hash an arbitrary set of leaves a) feels like something to be left for a general package (like xtlog itself), not subtree; and b) feels dangerous! What if the caller doesn't respect the contract that they need to make sure it's a valid subtree first?

I think this package should only expose func SubtreeHash(start, end int64, reader tlog.HashReader) tlog.Hash. It can verify that the interval is a valid subtree, then read all the necessary hashes from the reader itself. This guarantees that we never supply an invalid set of input hashes.

I think the existing API of this function might be acceptable as a private helper, but not as part of the public interface of the package.

HashLeaves, renamed from Hash to make its intention clearer, has just one external caller that needs it and it requires this shape. For each uploaded package, the tlog-mirror-srv's add-entries handler hashes the received leaf entries with tlog.RecordHash, combines those leaf hashes into the subtree hash with HashLeaves, and verifies the package's subtree consistency proof against the pending checkpoint's root. At that point the entries are not in any tree and thus not loaded into a reader.

That said, it doesn't necessarily have to live here, this just seemed like the best home for it.

aarongable · 2026-06-26T22:34:45Z

+	indexes := perfectSubtreeIndexes(start, end, nil)
+	hashes, err := reader.ReadHashes(indexes)
+	if err != nil {
+		return tlog.Hash{}, err
+	}
+	if len(hashes) != len(indexes) {
+		// Reader returned a slice shorter or larger than the requested indexes.
+		// Avoid panicking in combineIntervalHash.
+		return tlog.Hash{}, fmt.Errorf("ReadHashes returned %d hashes for %d indexes", len(hashes), len(indexes))
+	}
+	h, _ := combineIntervalHash(start, end, hashes)
+	return h, nil


This feels too clever, as though you're optimizing for a case we don't actually have (and don't want to support). Even if we do want to compute hashes of arbitrary intervals (which as I note above, I don't think we do -- I'm pretty sure all of our inputs to this function are valid subtrees), the MTC draft shows that any arbitrary interval can be decomposed into two subtrees: a perfect left subtree, and a ragged right subtree.

This function's approach is to do a big recursive "find all the indices", followed by a second recursive "combine the hashes". But the whole thing can be done in a single, much-more-simply recursive "look up hash of left perfect tree, combine with recursive computation of hash of right ragged tree".

You're right, all of the inputs to this function are valid subtrees. But this isn't meant to support arbitrary intervals. tlog.HashReader is attached to our tile store (S3/MinIO) and each ReadHashes call is a round trip. A single recursion would read one hash per subtree resulting in about log N round trips. Gathering the indices up front lets us do it in a single batched read, which is the only reason we build the list of indices first.

tlog: Add MTC subtree generation and verification

b2597ca

beautifulentropy changed the title ~~tlog: Add MTC subtree generation and verification~~ tlog: Add MTC subtree primitives Jun 17, 2026

beautifulentropy marked this pull request as ready for review June 17, 2026 18:58

beautifulentropy requested a review from a team as a code owner June 17, 2026 18:58

beautifulentropy requested a review from aarongable June 17, 2026 18:58

style: split tuple assignments

bcc864b

beautifulentropy requested a review from jsha June 22, 2026 17:46

jsha reviewed Jun 25, 2026

View reviewed changes

Comment thread trees/subtree/subtree.go Outdated

Comment thread trees/subtree/subtree.go Outdated

Comment thread tlog/subtree.go Outdated

jsha reviewed Jun 25, 2026

View reviewed changes

beautifulentropy added 3 commits June 26, 2026 11:39

Move from tlog to subtree package in trees, combine test helpers

55f815c

Merge remote-tracking branch 'origin/main' into tlog-subtree

63e9390

Bring terminology closer to the MTC draft and RFC 9162

515bd51

jsha mentioned this pull request Jun 26, 2026

trees/cosigned: Add cosigned.Message #8819

Merged

aarongable reviewed Jun 26, 2026

View reviewed changes

beautifulentropy added 4 commits June 29, 2026 12:24

"Complete" rename + comment update

162d91f

Address comments

70f2c8d

Add splitpoint

9d3f684

Use absolute coordinates [windowStart, windowEnd) in subtreeSubProof

e5854b6

beautifulentropy requested review from aarongable, ezekiel and jsha June 30, 2026 15:09

beautifulentropy changed the title ~~tlog: Add MTC subtree primitives~~ trees/subtree: Add MTC subtree primitives Jun 30, 2026

Uh oh!

Uh oh!

Conversation

beautifulentropy commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jsha left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jsha Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

beautifulentropy Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

jsha Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

beautifulentropy Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

aarongable Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

beautifulentropy Jun 29, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aarongable Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

beautifulentropy Jun 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beautifulentropy commented Jun 17, 2026 •

edited

Loading

beautifulentropy Jun 29, 2026 •

edited

Loading